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, • - SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 

(A) NAME: Jarl Wikberg 

(B) STREET: Trillvaegen 13 

(C) CITY : Umea 

( E ) COUNTRY : Sweden 

(F) POSTAL CODE (ZIP) : 905 92 Umea 

(A) NAME: Vijay Chhajlani 

(B) STREET: Stigbergsvaegen 

(C) CITY: Uppsala 

(E) COUNTRY: Sweden 

(F) POSTAL CODE (ZIP) : 752 42 Uppsala 

(ii) TITLE OF INVENTION: Human Melanocyte stimulating hormo 
(iii) NUMBER OF SEQUENCES: 2 0 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sterne, Kessler, Goldstein & Fox 

(B) STREET: 1100 New York Ave., N.W. 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP: 20005 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE : Patentln Release #1.0, Version #1.25 (EPO) 

(vr) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/387,805 

(B) FILING DATE: 21-FEB-95 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/DK93/ 0027 3 

(B) FILING DATE: 20-AUG-93 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DK 1046/92 

(B) FILING DATE: 21-AUG-92 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DK 1118/92 

(B) FILING DATE: 10-SEP-92 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DK 0528/9 3 

(B) FILING DATE: 05-MAY-93 

(viii) ATTORNEY/AGENT INFORMATION: 
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(A) NAME: Cimbala, Michele A. 

(B) REGISTRATION NUMBER: 33,851 

(C) REFERENCE /DOCKET NUMBER: 1102.0160000 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202)371-2600 

(B) TELEFAX: (202) 371-2540 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA (cDNA) 

(ix) FEATUPE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 169.. 1122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GGAGAGGGTG TGAGGGCAGA TCTGGGGGTG CCCAGATGGA AGGAGGCAGG CAT GGGGGAC 60 

ACCCAAGGCC CCCTGGCAGC ACCATGAACT AAG C AGGAC A CCTGGAGGGG AAGAACTGTG 12 0 

GGGACCTGGA GGCCTCCAAC GACTCCTTCC TGCTTCCTGG ACAGGACT ATG GCT GTG 17 7 

Met Ala Val 



1 



CTG GAG GTG TCr ATC TCT GAC GGG CTC TTC CTC AGC CTG GGG CTG GTG 
Leu Glu Val Ser He Ser Asp Gly Leu Phe Leu Ser Leu Gly Leu Val 

40 45 50 

AGC TTG GTG GAG AAC GCG CTG GTG GTG GCC ACC ATC GCC AAG AAC CGG 
Ser Leu Val Glu Asn Ala Leu Val Val Ala Thr He Ala Lys Asn Arg 

60 65 



2 5 



CAG GGA TCC CAG AGA AGA CTT CTG GGC TCC CTC AAC TCC ACC CCC ACA 

Gin Gly Ser Gin Arg Arg Leu Leu Gly Ser Leu Asn Ser Thr Pro Thr 
5 10 15 

GCC ATC CCC CAG CTG GGG CTG GCT GCC AAC CAG ACA GGA GCC CGG TGC 27 3 

Ala He Pro Gin Leu Gly Leu Ala Ala Asn Gin Thr Gly Ala Arg Cys 

20 25 30 35 



321 



3 69 



AAC CTG CAC TCA CCC ATG TAC TGC TTC ATC TGC TGC CTG GCC TTG TCG 417 
Asn Leu His Ser Pro Met Tyr Cys Phe He Cys Cys Leu Ala Leu Ser 
70 75 80 

GAC CTG CTG GTG AGC GGG AGC AAC GTG CTG GAG ACG GCC GTC ATC CTC 465 
Asp Leu Leu Val Ser Gly Ser Asn Val Leu Glu Thr Ala Val He Leu 

85 90 95 

CTG CTG GAG GCC GGT GCA CTG GTG GCC CGG GCT GCG GTG CTG CAG CAG 513 
Leu Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val Leu Gin Gin 
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100 105 



110 H5 



CTG GAC AAT GTC ATT GAC GTG ATC ACC TGC AGC TCC ATG CTG TCC AGC 5 61 

Leu Asp Asn Val He Asp Val He Thr Cys Ser Ser Met Leu Ser Ser 

120 125 130 

CTC TGC TTC CTG GGC GCC ATC GCC GTG GAC CGC TAC ATC TCC ATC TTC 609 
Leu Cys Phe Leu Gly Ala He Ala Val Asp Arg Tyr He Ser He Phe 

135 140 145 

TAC GCA CTG CGC TAC CAC AGC AT C GTG ACC CTG CCG CGG GCG CGG CGA 657 
Tyr Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg Ala Arg Arg 
150 155 160 

CGC GTT GCG GCC ATC TGG GTG GCC AGT GTC GTC TTC AGC ACG CTC TTC 7 05 

Arg Val Ala Ala He Trp Val Ala Ser Val Val Phe Ser Thr Leu Phe 
165 170 175 

ATC GCC TAC TAC GAC CAC GTG GCC GTC CTG CTG TGC CTC GTG GTC TTC 7 53 

II- Ala Tyr Tyr Asp His Val Ala Val Leu Leu Cys Leu Val Val Phe 
180 185 190 195 

XT " CTG GCT ATG CTG GTG CTC ATG GCC GTG CTG TAC GTC CAC ATG CTG 801 
Ph- L-u Ala Met Leu Val Leu Met Ala Val Leu Tyr Val His Met Leu 

200 205 210 

GCC CGG GCC TGC CAG CAC GCC GAG GGC ATC GCC CGG CTC CAC AAG AGG 8 49 

Ala Arg Ala Cys Gin His Ala Gin Gly He Ala Arg Leu His Lys Arg 

21 r 3 22 0 22 5 

CAG CGC CCG GTC CAC CAG GGC TTT GGC CTT AAA GGC GCT GTC ACC CTC 897 
Gin Arg Pro Val His Gin Gly Phe Gly Leu Lys Gly Ala Val Thr Leu 

230 235 240 

ACC ATC CTG CTG GGC ATT TTC TTC CTC TGC TGG GGC CCC TTC TTC CTG 94 5 

Thr He Leu Leu Gly He Phe Phe Leu Cys Trp Gly Pro Phe Phe Leu 

245 250 255 

CAT rTC ACA CTC ATC GTC CTC TGC CCC GAG CAC CCC ACG TGC GGC TGC 9 93 

His Leu Thr Leu He Val Leu Cys Pro Glu His Pro Thr Cys Gly Cys 
260 265 270 275 



o 



80 



2 85 2 90 



ATC TTC AAG AAC TTC AAC CTC TTT CTC GCC CTC ATC ATC TGC AAT GCC 1041 
He Phe Lys Asn Phe Asn Leu Phe Leu Ala Leu He He Cys Asn Ala 
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ATC ATC GAC CCC CTC ATC TAC GCC TTC CAC AGC CAG GAG CTC CGC AGG 
He He Asp Pro Leu He Tyr Ala Phe His Ser Gin Glu Leu Arg Arg 

295 300 305 

ACG CTC AAG GAG GTG CTG ACA TGC TCC TGG TGAGCGCGGT GCACGCGCTT 113 9 

Thr Leu Lys Glu Val Leu Thr Cys Ser Trp 
310 315 

TAAGTGTGCT GGGCAGAGGG AGGTGGTGAT ATTGTGTGGT CTGGTTCCTG TGTGACCCTG 1199 
GGCAGTTCCT TACCTCCCTG GTCCCCGTTT GTCAAAGAGG AT G GAC T AAA TGATCTCTGA 12 5 9 
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AAGTGTTGAA G 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: polypeptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Val Gin Gly Ser Gin Arg Arg Leu Leu Gly Ser Leu Asn Ser 
1 5 10 15 

Thr Pro Thr Ala He Pro Gin Leu Gly Leu Ala Ala Asn Gin Thr Gly 

20 25 30 

Ala Arg Cys Leu Glu Val Ser He Ser Asp Gly Leu Phe Leu Ser Leu 

35 40 45 

Glv Leu Val Ser Leu Val Glu Asn Ala Leu Val Val Ala Thr He Ala 

50 55 60 

Lys Asn Arg Asn Leu His Ser Pro Met Tyr Cys Phe He Cys Cys Leu 
65 70 75 80 

Ala Leu Ser Asp Leu Leu Val Ser Gly Ser Asn Val Leu Glu Thr Ala 

85 90 95 

Val He Leu Leu Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val 

100 105 11° 

Leu Gin Gin Leu Asp Asn Val He Asp Val He Thr Cys Ser Ser Met 

115 120 125 

Leu Ser Ser Leu Cys Phe Leu Gly Ala He Ala Val Asp Arg Tyr He 
130 135 140 

Ser lie Phe Tyr Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg 
145 150 155 160 

Ala Arg Arg Arg Val Ala Ala He Trp Val Ala Ser Val Val Phe Ser 

165 170 175 

Thr Leu Phe He Ala Tyr Tyr Asp His Val Ala Val Leu Leu Cys Leu 

180 185 190 

Val Val Phe Phe Leu Ala Met Leu Val Leu Met Ala Val Leu Tyr Val 

195 200 205 

His Met Leu Ala Arg Ala Cys Gin His Ala Gin Gly He Ala Arg Leu 
210 215 220 

His Lys Arg Gin Arg Pro Val His Gin Gly Phe Gly Leu Lys Gly Ala 
225 230 235 240 
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# 



Val Thr Leu Thr 



Phe Phe Leu His 

260 

Cys Gly Cys lie 
27 5 

Cys Asn Ala lie 

2 90 

Leu Arg Arg Thr 

305 



lie Leu Leu Gly 

245 

Leu Thr Leu lie 



Phe Lys Asn Phe 

280 

lie Asp Pro Leu 

2 9 5 

Leu Lys Glu Val 

310 
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He Phe Phe Leu 

250 

Val Leu Cys Pro 
265 

Asn Leu Phe Leu 



He Tyr Ala Phe 

3 0 0 



Leu Thr Cys Ser 
315 



Cys Trp Gly Pro 

2 55 

Glu His Pro Thr 

270 

Ala Leu He lie 
285 

His Ser Gin Glu 



Trp 



(2) INFORMATION FOP. SEQ ID NO: 3: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic: acid 

(C) STPANDEDNESS : single 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: DNA (synthetic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGGAATTCTG TGTGTNATCN CNGTGGACCG GTA 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 35 base pairs 
( b } TYPE: nucleic acid 

(C) STPANDEDNESS: single 

(D) TOFOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 
(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 
GGGGATCCGA AGAAGGGNAA CCAGCAGAGN ATGAA 



(2) INFORMATION FOP SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TO PC' LOGY: linear 

(ii) MOLECULE TYPE: DNA ( PCR- fragment ) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 



100 

(B) LOCATION: 1 . .285 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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TTC TAC GCA CTG CGC TAC CAC AGC ATC GTG ACC ATG CGC CGC ACT GTG 
Phe Tyr Ala Leu Arg Tyr His Ser lie Val Thr Met Arg Arg Thr Val 
15 10 15 

GTG GTG CTT ACG GTC ATC TGG ACG TTC TGC ACG GGG ACT GGC ATC ACC 9 6 

Val Val Leu Thr Val He Trp Thr Phe Cys Thr Gly Thr Gly He Thr 

20 25 30 

ATG GTG ATC TTC TCC CAT CAT GTG CCC ACA GTG ATC ACC TTC ACG TCG 14 4 

Met Val He Phe Ser His His Val Pro Thr Val He Thr Phe Thr Ser 
35 40 45 

CTG TTC CCG CTG ATG CTG GTC TTC ATC CTG TGC CTC TAT GTG CAC ATG 192 
Leu Phe Pro Leu Met Leu Val Phe He Leu Cys Leu Tyr Val His Met 

50 55 60 

TTC CTG CTG GCT CGA TCC CAC ACC AGG AAG ATC TCC ACC CTC CCC AGA 24 0 

Phe Leu Leu Ala Arg Ser His Thr Arg Lys He Ser Thr Leu Pro Arg 

65 70 75 80 

GCC AAC ATG AAA GGG GCC ATC ACC CTC ACC AT C CTG CTG GGC ATT 2 85 

Ala Asn Met Lys Gly Ala He Thr Leu Thr He Leu Leu Gly He 

3: 90 9 5 

(2) INFORMATION FOP SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Phe Tyr Ala Leu Arq Tyr His Ser He Val Thr Met Arg Arg Thr Val 
1 5 10 15 

Val Val Leu Thr Val He Trp Thr Phe Cys Thr Gly Thr Gly He Thr 

20 25 30 

Met Val He Phe Ser His His Val Pro Thr Val He Thr Phe Thr Ser 

35 40 45 

Leu Phe Pro Leu Met Leu Val Phe He Leu Cys Leu Tyr Val His Met 

50 55 60 

Phe Leu Leu Ala Arg Ser His Thr Arg Lys He Ser Thr Leu PrD Arg 

65 70 75 80 

Ala Asn Met Lys Gly Ala He Thr Leu Thr He Leu Leu Gly He 

85 90 95 



# 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 306 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA ( PCR- fragment ) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 306 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
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TTC TAC GCA CTG CGC TAG CAC AGC ATC GTG ACG GCG AGG CGC TCA GGG 
Phe Tyr Ala Leu Arg Tyr His Ser lie Val Thr Ala Arg Arg Ser Gly 
1 5 10 15 

GCC ATC ATC GCC GGC ATC TGG GCT TTC TGC ACG GGC TGC GGC ATT GTC 9 6 

Ala He He Ala Gly He Trp Ala Phe Cys Thr Gly Cys Gly He Val 

20 2 5 30 

TTC ATC CTG TAC TCA GAA TCC ACC TAC GTC ATC CTG TGC CTC ATC TCC 144 
Phe He Leu Tyr Ser Glu Ser Thr Tyr Val He Leu Cys Leu He Ser 
35 40 45 

ATG TTC TTC GCT ATG CTG TTC CTC CTG GTG TCT CTG TAC ATA CAC ATG 192 
Met Phe Phe Ala Met Leu Phe Leu Leu Val Ser Leu Tyr He His Met 

50 55 60 

TTC CTC CTG GCG CGG ACT CAC GTC AAG CGG ATC GCG CTC TGC CCG GGG 24 0 

Phe Leu Leu Ala Arg Thr His Val Lys Arg He Ala Leu Cys Pro Gly 
65 70 75 80 

CCA GCT CTG CGC GGC AGA GGA CCA GCA TGC AGG GGC GCG GTC ACC CTC 28 8 

Pro Ala Leu Arg Gly Arg Gly Pro Ala Cys Arg Gly Ala Val Thr Leu 

85 90 95 

ACC ATC CTG CTG GGC ATT 306 

Thr He Leu Leu Gly He 

100 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: polypeptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Phe Tyr Ala Leu Arg Tyr His Ser He Val Thr Ala Arg Arg Ser Gly 
15 10 15 



Ala lie lie Ala 

20 

Phe lie Leu Tyr 

35 

Met Phe Phe Ala 

50 

Phe Leu Leu Ala 

65 

Pro Ala Leu Arg 



Thr lie Leu Leu 

1 0 0 



Gly lie Trp Ala 



Ser Glu Ser Thr 

40 

Met Leu Phe Leu 
55 

Arq Thr His Val 

7 0 

Gly Arg Gly Pro 

8 5 
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Phe Cys Thr Gly 

2 5 

Tyr Val lie Leu 



Leu Val Ser Leu 

60 

Lys Arg lie Ala 

75 

Ala Cys Arg Gly 

90 



Cys Gly lie Val 

30 

Cys Leu lie Ser 
45 

Tyr lie His Met 



Leu Cys Pro Gly 

8 0 

Ala Val Thr Leu 

95 



(2) INFORMATION FOF" SEQ ID NO : 9: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA ( PCR- fragment ) 

(IX) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 312 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TTC TAC GCA CTG CGT TAC CAC AGC ATC GTG ACC GTG CGG CGG GCC CTC 18 
Phe Tyr Ala Leu Arg Tyr His Ser lie Val Thr Val Arg Arg Ala Leu 

I ' 5 ~ 10 15 

ACC TTG ATC GTG GCC ATC TGG GTC TGC TGC GGC GTC TGT GGC GTG GTG 96 
Thr Leu lie Val Ala lie Trp Val Cys Cys Gly Val Cys Gly Val Val 

2 0 2 5 3 0 

TTC ATC GTC TAC TCG GAG AGC AAA ATG GTC ATT GTG TGC CTC ATC ACC 144 
Phe lie Val Tyr Ser Glu Ser Lys Met Val lie Val Cys Leu lie Thr 
35 ~ 40 45 

ATG TTC TTC GCC ATG ATG CTC CTC ATG GGC ACC CTC TAC GTG CAC ATG 192 
Met Phe Phe Ala Met; Met Leu Leu Met Gly Thr Leu Tyr Val His Met 

50 55 60 

TTC CTC TTT GCG CGG CTG CAC GTC AAG CGC ATA GCA GCA CTG CCA CCT 240 
Phe Leu Phe Ala Arg Leu His Val Lys Arg lie Ala Ala Leu Pro Pro 

65 "70 7 5 8 0 

GCC GAC GGG GTG G--J CCA CAG CAA CAC TCA TGC AT G AAG GGG GCA GTC 288 
Ala Asp Gly Val Ala Pro Gin Gin His Ser Cys Met Lys Gly Ala Val 

8 5 9 0 9 5 
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ACC CTC ACC ATC CTG CTG GGC ATT 312 
Thr Leu Thr lie Leu Leu Gly lie 

100 

(2) INFORMATION FOP SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Phe Tyr Ala Leu Arg Tyr His Ser lie Val Thr Val Arg Arg Ala Leu 
1 " 5 10 15 

Thr Leu He Val Ala He Trp Val Cys Cys Gly Val Cys Gly Val Val 

20 25 30 

Phe He Val Tyr Ser Glu Ser Lys Met Val He Val Cys Leu He Thr 

35 40 45 

Met Phe Phe Ala Met Met Leu Leu Met Gly Thr Leu Tyr Val His Met 

50 55 60 

Phe Leu Phe Ala Arq Leu His Val Lys Arg He Ala Ala Leu Pro Pro 

65 ' 70 75 80 

Ala Asp Gly Val Ala Pre Gin Gin His Ser Cys Met Lys Gly Ala Val 

8 5 9 0 9 5 

Thr Leu Thr He Leu Leu Gly He 

100 

(2) INFORMATION' FOR SEQ ID NO: 11: 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA ( PCR- fragment ) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . .3 72 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CTG TGT GTG ATC GCG CTG GAC CGG TAC ATC TCC ATC TTC TAG GCA CTG 

Leu Cys Val He Ala Leu Asp Arg Tyr He Ser He Phe Tyr Ala Leu 
1 " 5 10 15 



48 



104 

CGC TAC CAC AGC ATC GTG ACC CTG CCG CGG GCG CCG GAA GCC GTT GCG 9 6 

Arq Tyr His Ser lie Val Thr Leu Pro Arg Ala Pro Glu Ala Val Ala 

20 25 30 

GCC ATC TGG GTG GCC AGT GTC GTC TTC AGC ACG CTC TTC ATC GCC TAC 144 

Ala He Trp Val Ala Ser Val Val Phe Ser Thr Leu Phe He Ala Tyr 
35 40 45 

TAC GAC CAC GTG GCC GTC CTG CTG TGC CTC GTG GTC TTC TTC CTG GCT 192 

Tyr Asp His Val Ala Val Leu Leu Cys Leu Val Val Phe Phe Leu Ala 

5 0 55 60 

ATG CTG GTG CTC ATG GCC GTG CTG TAC GTC CAC ATG CTG GCC CGG GCC 240 

Met Leu Val Leu Met Ala Val Leu Tyr Val His Met Leu Ala Arg Ala 
65 70 75 80 



TGC CAG CAC GCC CAG GGC ATC GCC CGG CTC CAC AAG AGG CAG CGC CCG 
Cys Gin His Ala Gin Gly He Ala Arg Leu His Lys Arg Gin Arg Pro 

8 5 90 9 5 
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GTC CAC CAG GGC TTT GGC CTT AAA GGC GCT GTC ACC CTC ACC ATC CTG 33 6 

Val His Gin Gly Phe Gly Leu Lys Gly Ala Val Thr Leu Thr He Leu 

100 105 110 

CTG GGC ATT TTC ACC GTC TCG TGG CGC CCC TTC TTC 37 2 

Leu Gly He Phe Thr Val Ser Trp Arg Pro Phe Phe 
115 120 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Leu Cys Val He Ala Leu Asp Arg Tyr He Ser He Phe Tyr Ala Leu 
1 ' 5 10 15 

Arq Tyr His Ser He Val Thr Leu Pro Arg Ala Pro Glu Ala Val Ala 

20 25 30 

Ala He Trp Val Ala Ser Val Val Phe Ser Thr Leu Phe He Ala Tyr 

35 40 45 

Tyr Asp His Val Ala Val Leu Leu Cys Leu Val Val Phe Phe Leu Ala 

50 55 60 

Met Leu Val Leu Met Ala Val Leu Tyr Val His Met Leu Ala Ai g Ala 

65 70 75 80 

Cys Gin His Ala Gin Gly He Ala Arg Leu His Lys Arg Gin Arg Pro 

85 90 95 
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Val His Gin Gly Phe Gly Leu Lys Gly Ala Val Thr Leu Thr lie Leu 

100 ^ 105 110 

Leu Gly He Phe Thr Val Ser Trp Arg Pro Phe Phe 
115 12 0 



(2) INFORMATION FOR SEQ ID NO: 13: 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GGGAATTCTA CGCACTGCGC T AC CACAGCA TCGTG 



(2) INFORMATION FOP SEQ ID NO: 14: 

I l ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TC'POLOGY: linear 

(ix) MOLECULE TYPE: DNA (synthetic) 
(xi) SEQUENCE E>ES CP I PTION : SEQ ID NO: 14: 
GGGGATCCAA TGCCCAGCAG GATGGTGAGG GTGA 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA ( c DNA ) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 616.. 1590 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



CTTGAGGAGA ATGTCGTGCA GTAGCCTTAG GAAT GT GAAC AT T GG GAGA C TGGCTGGGAT 
TTTGTAGGTT ATGAGAAGGG GACACTTATG AT AT GT GAAC TTGAGCCCAG GAGAGAAGC C 
AT AAAAAGT G AAACTGTCCT GGGCACTTGG AGGTGAGTGT CTCTCTAGTA AGATGCATGT 



6 0 
120 
18 0 
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GAAAGGCCTG GGAGCTGAAA GCAAGGAGAG CAGAAGAGGC TGGTGAAGAT TCTAATCTGC 2 40 

GTGTCCAGGG GCACTCTTCC AGGT CTCAGG AACGCAGGTC AGAATGTGCA AGCCAGCTGC 3 00 

CGGGCACGTG GCTCACCCCT GTAGTACCAG CACTTTGGGA GGCTGAGAGA GAAGATCGCT 3 60 

TGTGGCCAGG AGTTTGAGAC CAGACT GGGG CTTCATAGGG AGACCCTGTC TCTTAAAAAA 42 0 

AAAAAAAAAA AAGGACTGAG TGAGCCGAGC CCAGTCCTCT CAT GCACT GT GT CATT CAT C 4ft 0 

CCCTTTCTTA GGCTGTGTTG GTTCTAGGCT AGCTGCTGTC TTTCTTTGGT AGGCTGCTAA 54 0 

CCTCTTTGGA TTGT GAATTT AAAAC AT GT T T T AC AGT AAA TTTGCTGCCA AGACAAGAGG 600 

TGTATTTCTC CAGCA ATG AAT TCC TCA TTT CAC CTG CAT TTC TTG GAT CTC 651 

Met Asn Scr Ser Phe His Leu His Phe Leu Asp Leu 
15 10 

AAC CTG AAT GCC ACA GAG GGC AAC CTT TCA GGA CCC AAT GT C AAA AAC 6 9 

Asn Leu Asn Ala Thr Glu Glv Asn Leu Ser Gly Pro Asn Val Lys Asn 

15 20 25 

AAG TCT TCA CCA TGT GAA GAC ATG GGC ATT GCT GTG GAG GTG TTT CTC 7 47 

Lys Ser Ser Pro Cys Glu As] Met: Gly lie Ala Val Glu Val Phe Leu 

3 0 3 5 4 0 

ACT CTG GGT GTC ATC AGC CTC TTG GAG AAC ATC TTG GTC ATA GGG GCC 7 05 

Thr Leu Gly Val lie Ser Leu Leu Glu Asn lie Leu Val lie Gly Ala 
4 5 5 0 5 5 6 0 

ATA GTG AAG AAC AAA AAC CTG CAC TCC CCC ATG TAG TTC TTC GTG TGC 3 43 

lie Val Lys Asn Lys Asn Leu His Ser Pro Met Tyr Phe Phe Val Cys 

6 5 7 0 7 5 

AGC CTG GCA GTG GCG GAC ATG CTG GTG AGC ATG TCC AGT GCC TGG GAG 3 01 

Ser Leu Ala Val Ala Asp Met; Leu Val Ser Met Ser Ser Ala Trp Glu 

8 0 8 5 9 0 

ACC ATC ACC ATC: TAC CTA CTC AAC AAC AAG CAC CTA GT j ATA GCA GAC 939 
Thr lie Thr lie Tyr Leu Leu Asn Asn Lys His Leu Val lie Ala Asp 
95 100 10 5 

GCC TTT GTG CGC CAC ATT GAC AAT GTG TTT GAC TCC ATG ATC TGC ATT 987 
Ala Phe Val Arg His lie As a Asn Val Phe Asp Ser Met lie Cys lie 
110 115 120 

TCC GTG GTG GCA TCC ATG TGC AGC TTA CTG GCC ATT GCA GTG GAT AGG 103 5 

Ser Val Val Ala Ser Met Cys Ser Leu Leu Ala lie Ala Val Asp Arg 
125 130 135 140 

TAC GTC ACC ATC TTC TAC GCC CTG CGC TAC CAC CAC ATC ATG ACG GCG 108 3 

Tyr Val Thr lie Phe Tyr Ala Leu Arg Tyr His His lie Met Thr Ala 

145 ' 150 155 

AGG CGC TCA GGG GCC ATC ATC GCC GGC ATC TGG GCT TTC TGC ACG GGC 1131 
Arg Arg Ser Gly Ala lie lie Ala Gly He Trp Ala Phe Cys Thr Gly 

160 165 170 
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TGC GGC ATT GTC TTC ATC CTG TAC TCA GAA TCC ACC TAC GTC ATC CTG 117 9 

Cys Gly lie Val Phe lie Leu Tyr Ser Glu Ser Thr Tyr Val lie Leu 

175 180 185 

TGC CTC ATC TCC AT G TTC TTC GCT ATG CTG TTC CTC CTG GTG TCT CTG 1227 
Cys Leu lie Ser Met Phe Phe Ala Met Leu Phe Leu Leu Val Ser Leu 
190 195 20 0 

TAC ATA CAC ATG TTC CTC CTG GCG CGG ACT CAC GTC AAG CGG ATC GCG 12 7 5 

Tyr lie Hxs Met Phe Leu Leu Ala Arg Thr His Val Lys Arg lie Ala 
205 210 215 220 

CTC TGC CCG GGG CCA GCT CTG CGC GGC AGA GGA CCA GCA TGG CAG GGC 132 3 

Leu Cys Pro Gly Pro Ala Leu Arq Gly Arg Gly Pro Ala Trp Gin Gly 

225 230 235 

GCG GTC ACC GTC ACC ATG CTG CTG GGC GTG TTT ACC GTG TGC TGG GCC 1371 
Ala Val Thr Val Thr Met Leu Leu Gly Val Phe Thr Val Cys Trp Ala 

2 4 0 2 4 5 2 5 0 

CCG TTC TTC CTT CAT CTC ACT TTA ATG CTT TCT TGC CCT CAG AAC CTC 1419 
Pro Phe Phe Leu His Leu Thr Leu Met Leu Ser Cys Pre Gin Asn Leu 

2 5 5 2 6 0 2 6 c 

TAC TGC TCT CGC TTC ATG TCT CAC TTC AAT ATG TAC CTC ATA CTC ATC 14 67 

Tyr Cys Ser Arc Phe Met Ser His Phe Asn Met Tyr Leu lie Leu lie 
270 275 280 

ATG TGT AAT TCC GTG ATG GAC CCT CTC ATA TAT GCC TTC CGC AGC CAA 1515 
Met Cys Asn Ser Val Met Asp Pro Leu lie Tyr Ala Phe Arg Ser Gin 
285 " 290 295 300 

GAG ATG CGG AAG ACC TTT AAG GAG ATT ATT TGC TGC CGT GGT TTC AGG 15 63 

Glu Met Arg Lys Thr Phe Lys Glu lie lie Cys Cys Arg Gly Phe Arg 

305 310 315 

ATC GCC TGC AGC TTT CCC AGA AGG GAT T AAC GACAAA GTGCTCCTCT 1610 
lie Ala Cys Ser Phe Pro Arg Arq Asp 

320 325 

CTGTGGCTCT GTTCTCCTTT GTTTGCTCAC CT AT GACAAA 165 0 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 5 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Asn Ser Ser Phe His Leu His Phe Leu Asp Leu Asn Leu Asn Ala 

1 5 10 15 



Thr Glu Gly Asn Leu Ser Gly Pro Asn Val 



Lys 



Asn Lys Ser Ser Pro 



Cys Glu Asp Met 

3 5 

lie Ser Leu Leu 

50 

Lys Asn Leu His 

65 

Ala Asp Met Leu 



Tyr Leu Leu Asn 

10 0 

His lie Asp Asn 
115 

Ser Met Cys Ser 
130 

Phe Tyr Ala Leu 
145 

Ala lie lie Ala 



Phe lie Leu Tyr 

180 

Met Phe Phe Ala 
195 

Phe Leu Leu Ala 
210 

Pro Ala Leu Arg 
225 

Thr Met Leu Leu 



His Leu Thr Leu 

260 

Phe Met Ser His 
275 

Val Met Asp Pro 

290 

Thr Phe Lys Glu 

305 



Gly lie Ala Val 

40 

Glu Asn lie Leu 
55 

Ser Pro Met Tyr 

70 

Val Ser Met Ser 
85 

Asn Lys His Leu 



Val Phe Asp Ser 

120 

Leu Leu Ala lie 
135 

Arg Tyr His His 

150 

Gly lie Trp Ala 

165 

Ser Glu Ser Thr 



Met Leu Phe Leu 

200 

Arg Thr His Val 
215 

Gly Arg Gly Pro 

230 

Gly Val Phe Thr 
245 

Met Leu Ser Cys 



Phe Asn Met Tyr 

280 

Leu lie Tyr Ala 
295 

lie lie Cys Cys 
310 
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25 

Glu Val Phe Leu 



Val lie Gly Ala 

60 

Phe Phe Val Cys 
75 

Ser Ala Trp Glu 

90 

Val lie Ala Asp 
105 

Met lie Cys lie 



Ala Val Asp Arg 

140 

lie Met Thr Ala 
155 

Phe Cys Thr Gly 
170 

Tyr Val lie Leu 
185 

Leu Val Ser Leu 



Lys Arg lie Ala 

2 2 0 

Ala Trp Gin Gly 
235 

Val Cys Trp Ala 

2 5 0 

Pro Gin Asn Leu 
265 

Leu lie Leu lie 



Phe Arg Ser Gin 

300 

Arg Gly Phe Arg 
315 



Thr Leu Gly Val 
45 

lie Val Lys Asn 



Ser Leu Ala Val 

80 

Thr lie Thr lie 

95 

Ala Phe Val Arg 

110 

Ser Val Val Ala 

12 5 

Tyr Val Thr lie 



Arg Arg Ser Gly 

160 

Cvs Gly lie Val 
175 

Cys Leu lie Ser 
19 0 

Tyr lie His Met 

205 

Leu Cys Pro Gly 



Ala Val Thr Val 

240 

Pro Phe Phe Leu 
255 

Tyr Cys Ser Arg 
27 0 

Met Cys Asn Ser 
285 

Glu Met Arg Lys 



lie Ala Cys Ser 

32 0 



Phe Pro Arj Arg Asp 

325 



109 



(2) INFORMATION FOP. SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGAAGCTTTC TTTGGTAGGC TG 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GGTCTAGAGC CACAGAGAGG AG 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CTGCATTTCT TGGATCT 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



AAGCT GCACA TGGATGC 



